Picture for Yang Zheng

Yang Zheng

Design of an Expression Recognition Solution Employing the Global Channel-Spatial Attention Mechanism

Add code
Mar 15, 2025
Viaarxiv icon

Solution for 8th Competition on Affective & Behavior Analysis in-the-wild

Add code
Mar 14, 2025
Viaarxiv icon

Interactive Multimodal Fusion with Temporal Modeling

Add code
Mar 13, 2025
Viaarxiv icon

Dual-Stage Cross-Modal Network with Dynamic Feature Fusion for Emotional Mimicry Intensity Estimation

Add code
Mar 13, 2025
Viaarxiv icon

GroomLight: Hybrid Inverse Rendering for Relightable Human Hair Appearance Modeling

Add code
Mar 13, 2025
Viaarxiv icon

Road Traffic Sign Recognition method using Siamese network Combining Efficient-CNN based Encoder

Add code
Feb 21, 2025
Viaarxiv icon

PLPP: Prompt Learning with Perplexity Is Self-Distillation for Vision-Language Models

Add code
Dec 18, 2024
Viaarxiv icon

AIpparel: A Large Multimodal Generative Model for Digital Garments

Add code
Dec 05, 2024
Figure 1 for AIpparel: A Large Multimodal Generative Model for Digital Garments
Figure 2 for AIpparel: A Large Multimodal Generative Model for Digital Garments
Figure 3 for AIpparel: A Large Multimodal Generative Model for Digital Garments
Figure 4 for AIpparel: A Large Multimodal Generative Model for Digital Garments
Viaarxiv icon

CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization

Add code
Sep 11, 2024
Figure 1 for CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization
Figure 2 for CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization
Figure 3 for CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization
Figure 4 for CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization
Viaarxiv icon

RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models

Add code
Aug 27, 2024
Figure 1 for RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models
Figure 2 for RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models
Figure 3 for RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models
Figure 4 for RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models
Viaarxiv icon